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ABSTRACT 

During the Laser Interferometer Gravitational-wave Observatory and Virgo joint science runs in 2009-2010, 
gravitational wave (GW) data from three interferometer detectors were analyzed within minutes to select GW 
candidate events and infer their apparent sky positions. Target coordinates were transmitted to several telescopes 
for follow-up observations aimed at the detection of an associated optical transient. Images were obtained for 
eight such GW candidates. We present the methods used to analyze the image data as well as the transient search 
results. No optical transient was identified with a convincing association with any of these candidates, and none of 
the GW triggers showed strong evidence for being astrophysical in nature. We compare the sensitivities of these 
observations to several model light curves from possible sources of interest, and discuss prospects for future joint 
GW-optical observations of this type. 

Key words: binaries: close - catalogs - gravitational waves - stars: neutron - surveys 
Online-only material: color figures 


1. INTRODUCTION 

Transient gravitational-wave (GW) emission is expected 
from highly energetic astrophysical events such as stellar-core 
collapses and mergers of binary neutron stars (NSs). The Laser 
Interferometer Gravitational- wave Observatory (LIGO; Abbott 
et al. 2009; Harry et al. 2010) includes detectors located in the 
United States near Hanford, Washington (HI) and Livingston, 
LA (LI). A similarly designed Virgo (VI; Accadia et al. 2012; 
Virgo Collaboration 2009) detector is located in Italy near 
the city of Cascina. Each interferometer contains a pair of 
perpendicular arms, 4 km long in the LIGO detectors and 3 km 
in Virgo, whose effective optical path length is slightly altered 
by passing GW signals. Since 2007, LIGO and Virgo have 
coordinated operations and shared data, so the three sites operate 
as a single network of detectors seeking direct measurements 
of GW signals. A fourth site, GEO600 in Hannover, Germany 
(Grote et al. 2008), also shares data with LIGO and Virgo. 

During the 2009-2010 science run of the LIGO/ Virgo 
network (Abadie et al. 2012e) we implemented low-latency 
searches for GW transients. The analysis software identified 
GW event candidates (“triggers”), estimated their statistical 
significance, and reconstructed likely source positions in ap- 
proximately 10 minutes. Alert messages were transmitted to a 
network of electromagnetic observatories after the manual val- 
idation of the GW triggers with a total latency of ~30 minutes. 
The collection of optical telescopes, as well as the Swift satel- 
lite, LOFAR, and the Expanded Very Large Array (Lazio et al. 
2012), provided target of opportunity follow-up observations to 
the GW triggers. In earlier publications, we described the search 
method and likely sources of both GW and emission measure 
(EM) transients (Abadie et al. 2012c, 2012d), as well as the 
results of the follow-up observations performed with the Swift 
satellite (Evans et al. 2012). 

In this paper, we describe the data set collected with optical 
telescopes, detail the methods used to search the data for 
transients consistent with expected optical counterparts to GWs, 
and report the results of this analysis. In this first effort to 
use optical instruments to search for transients based on data 
from GW detectors, none of the GW triggers showed strong 
evidence for being astrophysical in nature. However, searching 
for transients in a large sky area is a challenging problem, and 
uncertainty in the expected light curve and spectrum of the 
sought optical counterpart makes the problem harder still. For 
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this reason, we emphasize the methodologies used to identify 
transient phenomena in our data set and to separate objects 
consistent with our target models from those that are not. In 
addition, we discuss the results of Monte Carlo simulations used 
to test the efficiency of our pipelines in recovering various types 
of transients, and the implications for future searches of optical 
counterparts of GW events discovered with next generation 
observatories. 

A variety of astrophysical processes are likely to be associ- 
ated with both GW and EM emission. Among these, gamma- 
ray bursts (GRBs) are promising sources for joint GW and EM 
studies (e.g., Kochanek & Piran 1993; Kobayashi & Meszaros 
2003; Abadie et al. 2012b). GRBs are traditionally divided in 
two main classes, long and short bursts (Kouveliotou et al. 
1993), which are thought to be associated with different pro- 
genitors (e.g., Gehrels et al. 2007; Meszaros 2006, and refer- 
ences therein). Long GRBs are associated with “collapsars,” 
the gravitational collapse of cores of massive stars (Woosley 
1993; MacFadyen & Woosley 1999), while short GRBs may 
be produced by mergers of binary systems of compact objects 
(NS/NS or black-hole/NS; e.g., Eichler et al. 1989; Paczynski 
1991; Narayan et al. 1992). A compact binary merger results 
from gravitational radiation, producing a characteristic “inspi- 
ral” of the binary orbit and a corresponding strong GW signal 
(e.g., Thorne 1987; Shibata & Taniguchi 2011). GW emission 
from a collapsar depends on non-spherically-symmetric flow of 
material during the collapse, which may be enhanced by cen- 
trifugal effects if the progenitor is rotating rapidly (Davies et al. 
2002; Fryer et al. 2002; Shibata et al. 2003; Piro & Pfahl 2007; 
Corsi & Meszaros 2009; Ott 2009; Romero et al. 2010). 

High-energy emission from GRBs is thought to escape as 
narrow relativistic jets (e.g., Sari et al. 1999; Harrison et al. 
1999; Frail et al. 2001; Racusin et al. 2009), though at least in 
the case of the short GRBs, there is uncertainty regarding the 
angular extent of typical beams (Fong et al. 2012), as well as 
how the beaming angle depends on wavelength (van Eerten & 
MacFadyen 2011). Afterglows of both classes of GRBs have 
been observed over a wide range of wavelengths (Costa et al. 
1997; Frail et al. 1997; van Paradijs et al. 1997; Gehrels et al. 
2005; Hjorth et al. 2005; Abdo et al. 2009), from times nearly 
concurrent with the prompt emission to days later (e.g., Nousek 
et al. 2006; Molinari et al. 2007; Racusin et al. 2011, and 
references therein). Generally, the observed optical afterglows 
fade with a temporal power-law decay, with typical indices 
between 1 and 1.5 (e.g., Sari et al. 1998; Nakar 2007). A wide 
range of luminosities have been observed, with the afterglows 
of short bursts tending to be less energetic than the afterglows 
of long bursts (Kann et al. 2011). 
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Table 1 

/?-band Light Curve Models Used for Simulated Injections 


Source 

Light Curve Model 


Normalization Condition 

Short GRB 
Long GRB 
Kilonova 

L oc t~ 1A 
L oc t~ lA 

L = (1.2 x 10 42 )f°' 43 ergs- 1 
L = (6.7 x 10 41 )?- 1 ' 29 ergs -1 

t < 0.7 days 
t > 0.7 days 

23-3 1 mag at 1 day from z = 1 
16-24 mag at 1 day from z = 1 


Notes. Normalizations used for the on-axis short GRB and long GRB models correspond to the full range of observed 
on-axis GRB afterglows in each class in the observer frame, assuming z = 1, from Kann et al. (2010, 2011). The kilonova 
model is intended to mimic the light curves shown in Metzger et al. (2010) and Piran et al. (2013). 


The merger of two NSs or a NS with a black hole may lead to a 
supernova-like transient, as described by Li & Paczynski (1998). 
In their model, heavy radioactive elements are formed in the 
merger ejecta through rapid neutron capture nucleosynthesis. As 
the newly formed isotopes decay toward stability, they release 
energy and heat the ejecta. Thermal emission becomes visible 
after the ejecta has expanded enough to allow photons to escape. 
The expected transient, referred to as a kilonova throughout this 
paper, is roughly isotropic, and the associated light curve is 
expected to peak about a day after the merger time (Metzger 
et al. 2010; Piran et al. 2013). The model has been supported by 
a variety of computational work (Faber & Rasio 2012; Roberts 
et al. 2011), though some details of the model are still uncertain, 
including the amount of mass ejected from the merger and 
the physics of the radiative transport. These unknowns lead 
to uncertainties in the peak luminosity, time-scale, and color 
evolution of the model. For example, Barnes & Kasen (2013) 
found that the ejected NS material may have a high opacity, 
leading to light curves that peak in infrared rather than optical 
wavelengths; this prediction seems consistent with one recent 
observation (Tanvir et al. 2013; Berger et al. 2013). For testing 
purposes, we adopted a simple model which was intended to 
mimic the main features of the light curves in Metzger et al. 
(2010) and Piran et al. (2013) (see Table 1). 

Core-collapse supernovae are expected to emit enough GW 
energy to be observable with current detectors within some 
fraction of the Milky Way, to distances of perhaps a few kpc (Ott 
2009). A rare class of core-collapse supernovae is also known to 
be linked to long GRBs (Galama et al. 1998; Woosley & Bloom 
2006; Soderberg et al. 2006). Indeed, optical follow-ups of 
GW triggers could catch optical supernovae harboring off-axis 
GRBs, whose gamma-ray emission would be missed because 
the relativistic GRB jet is not pointed toward earth (Granot 
et al. 2002; Rhoads 2003; van Eerten et al. 2010). However, 
unlike the models discussed above, tracking a supernova light 
curve requires several days or weeks of observations after the 
GW trigger (Doggett & Branch 1985). Slow light curves are 
also expected from off-axis GRBs, whose emission is expected 
to peak on timescales of weeks to months (e.g., van Eerten 
& MacFadyen 2011). Taking into account that the LIGO and 
Virgo detectors are expected to detect more merger events 
than core-collapse events, the cadence of our optical follow-up 
observations was chosen mainly for shorter optical transients, 
but with some observations extending to later times to possibly 
catch a slower transient. 

The paper is organized as follows: Section 2 first gives 
a description of the ground-based telescopes involved in the 
follow-up program. In Section 3, we present the set of GW 
triggers that were selected and sent as alerts to the telescopes and 
we describe their associated follow-up observations. Section 4 
details the methods employed to search for optical transients in 


the collected series of images and Section 5 reports the results 
of the searches. Finally, estimates of the search sensitivity are 
presented in Section 6. 


2. TELESCOPES INVOLVED IN THE 
FOLLOW-UP PROGRAM 

The optical follow-up program took place during times when 
the LIGO and Virgo observatories were operating in coincidence 
during 2009 and 2010. This time was divided into two segments: 
the “winter” run, between 2009 December and 2010 January, 
and the “autumn” run spanning most of 2010 September and 
October. The program was executed as a joint study between 
the LIGO and Virgo collaborations, and about 10 teams which 
operated automated and remotely controlled telescopes. 

During the winter run, triggers from the LIGO/ Virgo network 
were passed to the TAROT (Klotz et al. 2009) and QUEST 
(Baltay et al. 2007) telescopes. For the autumn run, the optical 
network was expanded to include Palomar Transient Factory 
(PTF; Rahmer et al. 2008; Law et al. 2009; Rau et al. 2009), Pi 
of the Sky (POTS; Malek et al. 2009), ROTSE III (Akerlof et al. 

2003) , SkyMapper (Keller et al. 2007), the Zadko Telescope 
(Coward et al. 2010), and the Liverpool Telescope (Steele et al. 

2004) . The large number (12) of telescopes participating in 
the autumn run allowed for better sky coverage. The main 
characteristics of these observatories are listed in Table 2. With 
the exception of the Liverpool RATCam and Zadko, they are 
all equipped with wide field cameras. A wide field of view 
(FOV) was considered an important feature for this study, due 
to the imprecise source localization of the GW instruments. We 
expected localizations of a few tens of square degrees up to 
200 deg 2 , and so instruments without a wide FOV would be 
unable to image a significant fraction of the uncertainty region 
(Cavalier et al. 2004; Nissanke et al. 2011; Fairhurst 2011; 
Klimenko etal.2011). However, with the limited sensitive range 
to an optimally aligned source (horizon distance) of initial LIGO 
and Virgo, it was also possible for an instrument to observe 
only the most likely host galaxies for a compact object merger 
(Abadie et al. 2012d; Kanner et al. 2008; Nuttall & Sutton 2010). 

Separate observing plans were constructed for each obser- 
vatory. Some of the instruments targeted only the single most 
likely field for a given GW trigger, while others observed mul- 
tiple fields in an effort to cover an area comparable to the GW 
position uncertainty (see Table 2). Planned cadences were also 
different for each observatory. Generally, the goal was to ob- 
serve at least once as quickly as possible to image a potential 
rapidly fading counterpart. Where possible, attempts were made 
to image each field over several nights following the GW trig- 
ger, in order to trace the light curves of potential transients. The 
details of the observations are described in Section 5. 
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Table 2 

Characteristics of Instruments Involved in the Search 


Name 

Locations 

FOV 

(deg 2 ) 

Aperture 

(m) 

Exposure Time 
(S) 

Limiting Magnitude 

Tiles 

Palomar Transient Factory 

i 

7.3 

1.2 

60 

20.5 

10 

Pi of the Sky 

i 

400 

0.072 

10 

11.5 

1 

QUEST 

i 

9.4 

1 

60 

20.5 

3 

ROTSE III 

4 

3.4 

0.45 

20 

17.5 

1 

SkyMapper 

1 

5.7 

1.35 

110 

21.5 

8 

TAROT 

2 

3.4 

0.25 

180 

17.5 

1 

Zadko Telescope 

1 

0.15 

1 

120 

20.5 

5 

Liverpool Telescope - RATCam 

1 

0.0058 

2 

300 

21 

1 

Liverpool Telescope - SkyCamZ 

1 

1 

0.2 

10 

18 

1 


Notes. The column labeled “Tiles” indicates the maximum number of different field positions that the telescope searched in response 
to a trigger. The shown limiting magnitudes are estimates, under ideal observing conditions. They are listed in r ' band for RATCam, 
r band for SkyMapper, and R band for all other instruments. Palomar Transient Factory, SkyMapper and RATCam are calibrated to the 
SDSS/AB photometric system; the others are expressed in the Vega photometric system. 


3. GRAVITATIONAL- WAVE TRIGGERS SELECTED 
FOR FOLLOW-UP OBSERVATIONS 

3.1. Trigger Selection 

Triggers for this search were identified with a collection of 
low-latency pipelines designed to find transient GW events in 
data from the three site LIGO/ Virgo network. Here, we provide 
a brief summary of the trigger production and selection, while a 
more detailed description is described in Abadie et al. (2012c, 
201 2d). During the winter run, two pipelines were used to 
identify generic short-duration transients of significant signal 
power, or “bursts,” and estimate their source positions: the 
Omega (£2) Pipeline (Searle et al. 2008; Abadie et al. 2010a) and 
the coherent WaveBurst (cWB) pipeline (Klimenko et al. 201 1). 
For the autumn run, a third trigger pipeline was added: the Multi- 
Band Template Analysis (MBTA; Beauville et al. 2008; Abadie 
et al. 2012c), which sought inspiral waveforms from coalescing 
compact objects. The autumn run also added a second instance 
of cWB, configured to target linearly polarized GW signals, as 
might be expected from supernovae. 

To compare triggers from different pipelines and identify the 
ones suitable for observation, follow-up software made event 
candidate selections based on the estimated false alarm rate 
(FAR) of each trigger. The rate of background false alarms 
was estimated by forming a distribution of artificial triggers 
from data with one or more data streams shifted by at least 
several seconds. Time-shifting data removes correlations of 
possible GW signals between detectors, so this distribution was 
considered to be free from any putative signals and represented 
the rate of triggers not due to transient GWs (Abadie et al. 
2012a, 2012e). During the winter run, a FAR threshold of 
1 trigger day' 1 was applied to triggers, and a less significant 
FAR was accepted in the last week to exercise the system. For the 
autumn run, the FAR threshold was set to 0.25 day” 1 . Triggers 
which passed the automated threshold received attention from 
an on-call follow-up team. The on-call team checked that the 
trigger occurred in high quality data in each interferometer. In 
addition, the criteria for manual validation in the winter run 
included demands that the three suggested (see below) QUEST 
fields covered a sky area corresponding to a greater than 50% 
probability of containing the GW source and that follow-up 
requests were sent at a rate of less than one per 24 hr. 

The trigger pipelines reported the estimated position of each 
candidate GW event as a skymap, a list of probability densities 


assigned to pixels in a grid covering the sky. The grid used 
pixels approximately 0.4 on a side, selected to be similar 
to the degree-scale resolving power of the GW network (for 
example, Fairhurst 2011; Klimenko et al. 2011; Vitale et al. 
2012; Nissanke et al. 2011). The large angular size of the 
skymaps required a choice of where within the uncertainty 
region to observe. To observe the regions most likely to contain 
an observable GW source, we used a catalog of galaxies within 
50 Mpc and Milky Way globular clusters (GWGC; White et al. 
2011), thought to be around 70% complete to 50 Mpc by 
/1-hand luminosity. Each pixel in the skymap was given a weight 
P according to the formula 


where L is the probability of the pixel derived from the GW 
data alone; M is the blue light luminosity of the galaxy or 
galaxies contained in the pixel, which is used as a proxy for 
the star formation rate; and D is the distance to the galaxy 
(Nuttall & Sutton 2010). For MBTA triggers, a slightly modified 
version of this approach was applied, using the maximum 
distance consistent with the apparent inspiral signal (Abadie 
et al. 2012c). The suggested fields for each telescope were those 
that maximized the sum of P within the respective FOV. Unless 
unobservable due to daylight or geometrical constraints, the 
suggested fields were passed to each optical telescope for every 
GW event candidate that passed manual validation. However, a 
more stringent selection was applied for PTF, and only one GW 
trigger was sent to PTF. 

3.2. Data Set 

In the winter run, the on-call team was alerted a total of 
nine times. Three of these triggers were vetoed by the on- 
call team. Six triggers were approved by the on-call team 
and sent to the QUEST and TAROT telescopes with roughly 
30 minutes of latency. Of the six requests, four were rejected 
as unobservable by the scheduling software of both telescopes 
and two triggers were followed-up with the QUEST telescope. 
In addition, two triggers that did not pass the automated FAR 
threshold were selected by the on-call team and passed to the 
partner observatories in an effort to expand the winter run data 
set (see Table 3). 

In the autumn run, only one trigger was manually rejected 
due to data quality concerns. Six triggers resulted in alerts 
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Table 3 





Gravitational Wave Triggers in the Winter Run 

ID 

Date 

UTC 

Pipeline 

FAR 

(day” 1 

Follow-up 

) 


G3821 

2009 Dec 29 

15:16:33 

Q 

0.66 

QUEST collected 1 2 images 


CWB1 

2010 Jan 3 

20:37:22 

cWB 

1.3 

Alert sent Jan 7; TAROT collected 6 images 


G4202 

2010 Jan 6 

06:49:45 

£2 

4.5 

QUEST collected 9 images 


CWB 2 

2010 Jan 7 

08:46:37 

cWB 

1.6 

QUEST collected 12 images 





Table 4 






Gravitational Wave Triggers in the Autumn Run 

ID 

Date 

UTC 

Pipeline 

FAR 


Follow-up 





(day -1 ) 



G19377 

2010 Sep 16 

06:42:23 

cWB (unmodeled) 

<0.01 

ROTSE collected 117 images, TAROT collected 20, Zadko 129, and 






SkyMapper 21 . Blind injection 

G20190 

2010 Sep 19 

12:02:25 

MBTA 

0.16 

ROTSE collected 257 images, QUEST 23, Zadko 159, and TAROT 3 

G21852 

2010 Sep 26 

20:24:32 

cWB (linear) 

0.02 

ROTSE collected 130 images, PTF 149, CAT 3 DQ 

G23004 

2010 Oct 3 

16:48:23 

£2 

0.21 

ROTSE collected 153 images, QUEST 40, Liverpool - RATCam 22, 






Liverpool - SkyCamZ 121, and POTS 444 


to the observing partners, four of which resulted in follow- 
up observations 147 (see Table 4). Two of the triggers are worth 
special note. The September 16 trigger was recognized by the 
on-call team as having a special significance: in addition to 
a small estimated FAR, spectrograms of the GW data revealed 
frequency evolution characteristic of the late inspiral and merger 
of two compact objects. This event was later revealed to be a 
blind hardware injection, a simulated signal secretly added to 
the data to test the end-to-end system. The September 26 event 
candidate was also discovered with a low FAR estimate. In 
subsequent GW data analysis, this trigger was found to be the 
most significant cWB trigger above 200 Hz in the time period 
where HI, LI, and VI were running in coincidence in this 
science run, though was removed from the analysis based on data 
quality concerns. The FAR was measured to be 0.023 events per 
day, or one such trigger expected for every 44 days of network 
livetime. Since these detectors ran in coincidence for a total of 
52.2 days throughout the Virgo science run, this trigger was 
consistent with expectations for detector noise. 

4. SEARCHES FOR OPTICAL TRANSIENTS 

A search for optical transients essentially consists of search- 
ing for fading optical point sources in a sequence of astronomical 
images. A few characteristics make the search for GW counter- 
parts unique. First, there is a significant uncertainty regarding 
the expected light curve from a GW source; we targeted short 
duration (hours to days) transients consistent with GRB after- 
glows and kilonovae light curves. Second, the poor localization 
of the GW error box required searching through a large portion 
of the sky. This significantly differed from the arcminute-scale 
error box used to find optical afterglows of GRBs discovered 
by Swift. Finally, we designed automated pipelines with Monte- 
Carlo simulations to evaluate the statistical significance of any 
apparent counterpart. 

The telescopes involved in the program included very differ- 
ent instruments ranging from shallow, very wide-held cameras 
to meter-class telescopes (Table 2). They collected images with 


147 Of the two triggers not observed, one was the first alert generated during 
the autumn run and ROTSE imaged the wrong location due to a software bug, 
while the other was too close to the Sun to be observable by any of the 
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different cadences and follow-up strategies, leading to a hetero- 
geneous data set. This has led us to develop a similarly hetero- 
geneous analysis approach, with techniques tailored to match 
the requirements of each observational data set. Where possi- 
ble, we leveraged existing software already in use by the various 
astronomical teams. The list of techniques which were applied 
in some, but not all, of the developed searches included image 
subtraction, identification of host galaxies, cuts on shape param- 
eters, automated transient classifiers, volunteer work by citizen 
scientists, and consistency checks on light curve properties. 

In future searches for optical counterparts to GW sources, a 
critical component will be rapidly down- selecting candidate lists 
to allocate follow-up resources such as large aperture photom- 
etry and spectroscopy. In this work, we attempted to unify re- 
sults from disparate analyses by developing two common search 
statistics, which were applied in multiple analyses. The first 
statistic was used to quantify the ability to reject false posi- 
tives, and labeled the “false-alarm probability” (FAP). The FAP 
was defined as the probability that a set of optical images taken 
with a given telescope in response to a single GW trigger, and 
analyzed with a given pipeline, would lead to a false positive. 
The FAP could encompass both false positives arising from 
technical noise, such as procedure artifacts, and astrophysical 
transients not related to the GW sources, such as M dwarf flares. 
Galactic variable stars, and extragalactic active galactic nuclei 
(AGNs) and supernovae. For most data sets, we set a FAP tar- 
get of 10%. This FAP level was chosen to reduce the number of 
false positives to a manageable level, so that each object passing 
the selection criteria could, in principle, be further studied with 
sensitive photometric and/or spectroscopic observations. The 
second statistic used to characterize an analysis was the detec- 
tion efficiency, defined as the recovery rate for simulated optical 
transients added to representative images. We measured detec- 
tion efficiencies for a few different model light curves, using 
data and analysis procedures from several different telescopes. 
The FAP measurements and the Monte Carlo simulations al- 
lowed us to find a good compromise between rejection of false 
positives and reduction of interesting EM candidates. For ex- 
ample, in a study with the QUEST and TAROT data, we found 
that increasing the FAP to 0.20 would produce less than a 30% 
improvement in the sensitive distance range of the search, and so 
would increase the sensitive search volume by roughly a factor 
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of two, while also doubling the number of false positives. This 
section describes the different methods that were used to iden- 
tify potential transients consistent with our models, and reduce 
false positives. 

4.1. Catalog-based Search for TAROT, 

Zadko, and QUEST Observations 

This section describes the image analysis pipeline developed 
specifically for the TAROT, Zadko Telescope, and QUEST 
observations. Unlike other approaches presented in this work, 
the pipeline did not use image subtraction but it extracted 
a source catalog from each image, and sought transients by 
comparing the set of catalogs to a reference. For this reason, we 
refer to this pipeline as the “catalog-based search.” 

4.1.1. Analysis Pipeline 

The search consisted of three main steps applied to the image 
set (after dark, flat and sky background level corrections): data 
photometric calibration, reconstruction of object light curves, 
and transient selection to identify possible electromagnetic 
counterparts. 

TAROT, the Zadko Telescope, and QUEST observed with a 
clear filter. The magnitude zero-point calibration was performed 
using the USNO-A2.0 catalog (Monet et al. 1998) as reference 
and resulted in red equivalent magnitudes. For the QUEST 
camera, which is composed of 1 12 individual CCDs, calibration 
was performed separately on each CCD. The different response, 
data quality, and sensitivity of each CCD prevented managing 
them as a single mosaic, and the data analysis was performed 
CCD by CCD. 

The source catalog of each image was extracted using 
SExtractor (Bertin & Arnouts 1996). Each list of sources 
was spatially cross-correlated with the star catalog USNO-A2.0 
using the tool match (Droege et al. 2006). The radius used to 
search for common sources was set to 10" for TAROT, 2" for 
Zadko, and 3" for QUEST. These values took into account the 
positional uncertainties in the images and in the USNO-A2.0 
catalog. Sources found to coincide in position and luminosity 
with objects listed in the reference catalog were excluded from 
the search. The lists of remaining sources were then mutually 
cross-correlated in position to link sources observed at different 
times to common astrophysical objects. This resulted in a light 
curve for each identified object. 

At this point, two types of analyses were conducted to select 
GW associated transients and reject background objects. The 
on-source analysis was restricted to objects lying in the image 
regions associated with galaxies within 50 M pc 1 ' ' and Galactic 
globular clusters. For each galaxy a circular region with a radius 
five times the galaxy’s semi-major axis (as provided by the 
GWGC; White et al. 2011) was analyzed. This region (which 
corresponds to an average radius of about 20 kpc) accounted for 
the typical projected physical offsets observed between GRB 
afterglows and their host galaxy centers (e.g., Berger 2010). The 
whole-field analysis covered the entire FOV but was limited to 
bright objects. For the QUEST telescope, large variations in 
the sensitivity and image quality between different CCDs made 
setting a whole-field magnitude threshold unfeasible to search 
the expected counterparts. For this reason, we performed only 
the on-source analysis on the QUEST data, which allowed us 
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to search for faint transients while limiting the number of false 
positives (see Section 4.1.2). 

For both types of analysis, rapid contaminating transients, 
including cosmic rays, asteroids, and CCD noise, were re- 
jected by requiring the presence of the object in a minimum 
number of consecutive images. Further selection of transient 
objects (and hence rejection of background) was performed 
by applying thresholds to the initial (first observation) mag- 
nitude and light curve variability of each source. Variability was 
characterized by assuming power-law luminosity dimming with 
time, £ oc V , corresponding to a linear magnitude variation 
m = 2.5/3 log 10 (f) + C. The slope index 2.5/3 was evaluated for 
each object. The expected slope indices for GRB afterglows and 
kilonova light curves are around 2.5-4 (see Table 1). To seek 
these transients, we applied a cut which selected slope indices 
greater than 0.5. Because of the small number of repeated ob- 
servations with QUEST (maximum of eight for each galaxy), a 
different variability measurement was used for this instrument’s 
analysis. A threshold on the flux variation between the first and 
the following nights of observation was set by requiring a dim- 
ming larger than +0.5 mag (while we expected >+l based on 
the light curve models and the QUEST observational cadence). 

Studies of the background events (Section 4. 1 .2) and the abil- 
ity to detect simulated on-axis GRBs and kilonovae (Section 6) 
were used to design selection criteria yielding a FAP of un- 
der 10% (prior probability that a background event passes all 
the selection criteria), while also accepting a wide range of as- 
trophysical models. The thresholds applied to the variability 
measure (slope index or flux variation) were designed to de- 
tect fading transients while leaving the possibility of detecting 
light curves showing flaring within short time-scales (hours). 
However, recent re-evaluations of kilonova emission by Barnes 
& Kasen (2013) and others have indicated that more realistic 
values for the opacities of the heavy radioactive elements lead 
to dimmer and broader light curves. These would be difficult to 
detect with the depth and cadence of our data set. 

4.1.2. Background Estimation 

The background was estimated by running the analysis over 
a series of images obtained from random time permutations 
of the real observation images. The first night observations 
were excluded from being selected as the first image in each 
permuted sequence to remove any astrophysical electromagnetic 
counterparts from the data set. The background simulation was 
repeated 100 times for TAROT and the Zadko Telescope and for 
all the permutations allowed by the observations for QUEST. 

Genuine optical transients would have lost their regularly fad- 
ing light curve in the scrambled image set. Random sequenc- 
ing thus erased them while artifacts such as CCD noise, pixel 
saturation, bad pixels, errors in the de-blending and source as- 
sociation, etc., were just as likely to pass the pipeline’s selec- 
tion cuts as with the true sequencing. This procedure allowed 
a measurement of the rate of false positives due to “technical” 
noise. However, this procedure did not permit a valuable esti- 
mate of the “astrophysical” background since the randomization 
reduced the number of identified astrophysical transients that ac- 
tually dimmed over time. A statistically significant estimate of 
the astrophysical background would require the study of survey 
data not associated with GW triggers, which was not available 
at this time. 

An example of the distribution of technical background events 
(after the removal of rapid transients) detected in the FOV of 
TAROT for trigger G 19377 is shown in Figure 1 . The cumulative 
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Figure 1. Background plots for TAROT data associated with trigger G19377 obtained by performing the on-source analysis (top plots) and whole-field analysis 
(bottom plots). In the left plots, N gives the cumulative number of technical background events found in a permuted set of images above the magnitude threshold shown 
on the X-axis, averaged over 100 permutations. The right plots show the FAP as a function of the slope index (in the case of whole-field analysis the requirement of an 
initial magnitude brighter than 14 was applied). 

(A color version of this figure is available in the online journal.) 
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distribution of their initial magnitude is shown in the left plot, 
and the FAP as a function of the slope index is in the central plot. 
The on-source analysis showed a greatly reduced background 
level compared to the whole-field analysis, since only objects 
near a local galaxy were included. In this example, the nominal 
slope index threshold of 0.5 reduced the FAP to less than 1% in 
the on-source analysis. For the whole-field analysis, in addition 
to the same cut on slope index, a requirement that objects showed 
an initial flux brighter than magnitude 14 was needed to reduce 
the FAP below the 10% objective. 

The “technical background” rate varied significantly between 
different instruments due to different fields of view, limiting 
magnitudes, image quality, and star crowding. For TAROT 
and Zadko, the number per square degree of “technical false 
positives” brighter than a reference magnitude of 14.5 mag for 
TAROT and 15.5 mag for Zadko was evaluated to be less than 
1 deg -2 using a slope index threshold of 0.5. For QUEST, the 
background study was performed CCD by CCD to account for 
the different density of false positives on each CCD. Compared 
to TAROT and Zadko, the deeper sensitivity observations of 
QUEST led to a higher number of false positives: an average 
value of 6 deg” 2 brighter than 18 mag and with magnitude 
variation larger than 0.5. Reducing the analysis to the on-source 
regions allowed us to lower the density of background transients 
to less than 1 deg” 2 . 

4.1.3. Analysis Tuning 

For TAROT and Zadko the two types of analysis were 
tuned to achieve 10% FAP using the on-source and whole-field 
backgrounds, respectively. The nominal slope index threshold 
(<0.5) resulted in the target FAP (< 10%) for half of the on- 
source analyses. For the other half, a threshold on the initial 
magnitude (in the range 12-13 mag) was also required. For the 
whole-field analyses, an initial magnitude threshold of 14 mag 


was demanded for the TAROT follow-up of G 19377 and a 
threshold of 10 mag for the Zadko follow-up of G19377, and 
the Zadko and TAROT follow-up of G20190. For these last 
three follow-ups the presence of observations taken months 
after the GW trigger allowed the additional requirement of the 
object’s presence in the early observations and its absence in the 
reference ones. 

For the QUEST on-source analysis, two methods were used to 
estimate the false positives. First, the background was evaluated 
directly in each on-source area. Due to the low statistics in these 
areas, a second estimate was also produced by rescaling the 
background event counts in the entire CCD to the on-source area. 
The target FAP (evaluated by both methods) was achieved for the 
majority of galaxies by demanding a magnitude variation larger 
than 0.5 between the first night and follow-up night observations, 
and an initial magnitude brighter than 17.5 for G20190, and 18.5 
for G23004. For eight galaxies associated with G23004, stronger 
thresholds on the initial magnitude (between 15 and 18.2) were 
required. 

Simulations have been performed for each set of images by 
using the exact thresholds applied for the analysis of the data 
associated with the GW trigger to prove the ability to detect 
likely EM counterparts (GRBs and kilonovae), and to evaluate 
the search sensitivity for the analysis procedure described above 
(see Section 6). 


4.2. ROTSE Search 

The ROTSE-II1 network consists of four robotic telescopes 
at various locations around the world. For each GW trigger 
in the autumn run, the telescopes repeatedly observed a single 
field. Each field was observed in a series of 30 exposures on 
the first night after the trigger time. Follow-up images were 
collected over the next 30 nights, with observations spaced an 
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average of every 2 nights. Each follow-up observation included 
8 exposures, each 20 or 60 s. 

We used the existing ROTSE pipeline to analyze the images 
taken with the network. Based on the ISIS package, 149 which 
uses a single convolution algorithm described in Alard & Lupton 
(1998) and Alard (2000), the ROTSE pipeline was adapted 
to use cross correlation to improve image subtraction results. 
The details of this method can be found in Yuan & Akerlof 
(2008). The pipeline was implemented for our analysis to require 
minimal user interaction and for large scale processing which 
enabled characterization of the background, as described in 
Nuttall etal. (2013). 

The pipeline began by stacking images from the same night 
on top of one another to form a coadded image. SExtractor 
was used to produce a list of objects and their coordinates for 
each coadded image. These images were then subtracted from 
the coadded reference image, and several criteria were imposed 
on any objects found in the subtracted image. Selection criteria 
included requiring a full width at half-maximum consistent with 
a point source, seeking a minimum fractional flux variation 
between images and a signal-to-noise ratio (S /N) greater than 
some amount. The specific criteria depended on the location 
of the source in an image. For example, if a source matched a 
star or an unknown object a flux change of 60% was required, 
whereas if a source was within 20% of the semi-major axis 
length from the center of a galaxy, but not consistent with a 
core, only a 3% flux change was required. The result was several 
lists of candidates (one from each night), which we combined 
to produce a single list of unique candidates which appeared in 
the images, and generated light curves for all candidates. 

The vast majority of these candidates were due to poor sub- 
traction, with a fraction of real but uninteresting transients (such 
as variable stars or asteroids). In order to remove contaminants 
from the list of candidate transients, each object was subjected 
to a series of cuts. In order to be of interest, the transient must 
have appeared on more than one night, shown a sufficiently 
decaying light curve 48 hr after the trigger, and not have been 
coincident with a known variable source (from the SIMBAD 
catalog 150 ) or with a minor planet (Minor Planet Checker 151 ). 
These cuts proved efficient at rejecting the majority of the back- 
ground. Candidates were then highlighted if they overlapped 
with known galaxies or if their light curves were consistent 
with a target theoretical light curve (Metzger et al. 2010; Kann 
et al. 2011, 2010). They were also assigned an ad hoc ranking 
statistic, R, defined as: 

R = ^(18 — 7«;)0(18 — nti) x Wj . (2) 

i 

Here ©(.*) is the step function, m,- is the background-subtracted 
magnitude of the transient in image i, and w, is a weight factor 
defined by 

1 t, - r GW < 1 day 

W ' ~ 1 (i + l°gio T^f) U - few > 1 day (3) 

where r G w is the time of the GW trigger, t\ is the time of image i. 
The ranking statistic was designed to prefer events which were 
bright within a day of the trigger time and which appear in 
multiple images. 

149 http ://w w w2 . iap.fr/users/alard/package . html 

150 http://simbad.u-strasbg.fr/simbad/ 

151 http://scully.cfa.harvard.edu/cgi-bin/checkmp.cgi 
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The ROTSE FAR was investigated by processing sets of 
images for each of 100 random field locations selected from 
the ROTSE archive. Each set contained ~240 images of the 
field from a month of nominally nightly observing. The FAP 
for each GW candidate was estimated by counting the number 
of transient objects visible in archived images with a similar 
cadence as the images collected for that GW candidate. The 
ranking statistic for each such transient object was calculated 
using Equation (2). These studies allowed us to set thresholds 
on the ranking statistic to keep the target light curves, while 
rejecting contaminants. 

4.3. Catalog-based Search for Pi of the Sky 

POTS has an unusually wide FOV of 20° x 20°, with a typical 
limiting magnitude of 1 1.5 for a 10 s exposure. This allowed 
the telescope to image a large part of the sky in response to one 
LIGO/Virgo trigger, over 40° x 40° on most nights. We used 
the standard POTS pipeline to analyze the images taken by the 
telescope. A detailed description may be found in Malek et al. 
(2009) and Sokolowski (2008). The full analysis was carried 
out in two steps. First, in each image taken by the telescope, the 
Guide Star Catalog (Jenkner et al. 1990) was used to identify 
previously unknown sources. Second, POTS’s nova recognition 
algorithm was applied to the list of unknown sources. To separate 
optical transients from contaminating sources, the algorithm 
utilized several types of vetoes, including checks on background 
saturation, nearby bright objects, satellite databases, and the 
Guided Star Catalog. Objects that passed the cuts were then 
visually inspected. 

During the human inspection stage, every candidate that 
was not identified as a satellite or background fluctuation was 
checked against lists of known sources. First, we queried the 
POTS, INTA (Spain) site for observations made in 2011. Due 
to the long time (~1 yr) between the autumn science run and 
observations from the INTA site, any objects observed by INTA 
were likely unrelated to the GW trigger. 159 Finally, objects were 
cross-correlated with the SIMBAD catalog, and sources that 
appeared nearer than 150" to the position of any known star or 
infrared source were rejected. 

4.4. SkyMapper Search 

SkyMapper obtained two epochs of an eight image mosaic 
covering a total of ~42 deg 2 in response to the 2010 Septem- 
ber 16 trigger. An image subtraction technique was applied 
to identify possible transients. The SkyMapper images were 
reduced via the normal bias subtraction, overscan correction 
and flat fielding using a custom made Python-based pipeline. 
Thereafter, frames from the two epochs were aligned with the 
WCSREMAP 153 routine and subtracted with HOTPANTS 154 to cre- 
ate residuals images. SExtractor was used to identify sources 
with S/N greater than three. Then, a series of cuts was applied 
to the SExtractor output parameters to identify noise and 
bad subtractions. These included using the ellipticity parameter, 
photometry from different size apertures, and catalog matching 
of variable stars. In addition, a study of the point-spread function 
(PSF) of each object was performed on the subtracted images 
by fitting the detection with a two-dimensional Gaussian and 
comparing the fit parameters to the expected, known, PSF. The 


152 All Pi of the Sky telescopes have the same cameras, so data gathered is 
easily comparable. 

1 5 3 http ://www. astro . Washington . edu/users/becker/v2 . 0/ c_software .html 

1 54 http ://www. astro . Washington . edu/users/becker/hotpants . html 
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remaining objects were then examined manually to verify they 
correspond to an object which was visible in the first epoch and 
not detectable/fainter in the second. The light curves were then 
measured using differential photometry with nearby stars. 

4.5. PTF Search 

The PTF accepted the trigger of 2010 September 26. Nine PTF 
fields, each covering 7.26 deg 2 , were schedule automatically for 
observations, and they were observed beginning ~6 hr after the 
trigger time (since the trigger occurred during day-time on the 
Pacific Coast). PTF then repeated the observations on several 
subsequent nights. The number of follow-up observations was 
mainly limited by full moon constraints. 

The imaged fields were searched for candidate transients us- 
ing the image subtraction pipeline hosted at LBNL (P. E. Nugent 
et al. 2014, in preparation; Gal- Yam et al. 2011). Only three 
of the fields imaged by PTF had previously constructed refer- 
ence images. For the rest of the fields, image subtraction was 
performed using a reference image constructed by co-adding 
several images taken during the first night of observations. Im- 
age differencing inherently produces a large number of spurious 
candidates, and only a small fraction (less than few percent) of 
these are real events. As described in Bloom et al. (2012), in a 
typical PTF night of order 10 5 residual sources are found per 
100-200 deg 2 of imaging, after performing subtraction of the 
reference image. 

To distinguish between astrophysical objects and “bogus” im- 
age subtraction residuals, we made use of a classification param- 
eter named the “realbogus” parameter (RB; Bloom et al. 2012), 
which was assigned by a machine-learned (ML) classifier so 
as to reasonably mimic the human scanning decision of real or 
bogus. The RB parameter ranged from 0 (definitely bogus) to 1 
(definitely real), and was constructed from 28 SExtractor out- 
put parameters, including magnitude, ellipticity of the source, 
and distance from the candidate to reference source. 

To maximize the chances of identifying a potential optical 
counterpart to G21852, the images collected by PTF were ana- 
lyzed using two different procedures for transient identification, 
both based on the RB parameter as a starting point (P. E. Nugent 
et al. 2014, in preparation). While the first procedure (hereafter, 
the “automated” approach) was largely based on automated ML 
techniques and optimized for fast transients, the second (here- 
after, the “citizen-based” approach) was largely based on a cit- 
izen project (Smith et al. 2011) and optimized for supernova 
searches. In what follows, we describe these two approaches in 
more detail. 

4.5.1. Automated Approach 

We identified the most promising fast transient candidates 
(i.e., transients with a variability on a timescale of a week or 
less) obtained in an image subtraction by applying the following 
selection criteria: 

1. RB 0.17 in at least one detection; 

2. matching of the candidate with at least one other detection 
with RB > 0.07; 

3. the second detection should be coincident with the candi- 
date position within 2" on the sky; 

4. the second detection should be at least 45 minutes (and no 
more than 6 days) before or after the original candidate. 

Candidates satisfying the above criteria were further passed 
through the so-called “Oarical classification routine” which, as 
part of the standard PTF operations, was designed to distinguish 
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between two main classes of events, namely “transients” and 
“variable stars.” The classifier used both time-domain features, 
such as light-curve evolution, and context features, including 
the location of the source relative to known stars and galaxies 
(see Bloom et al. 2012 for details). 

Candidates with high RB and high classification confidence 
were saved automatically in the so-called “PTF Marshal” web 
archive, and thus assigned an official “PTF name” and a tentative 
object type. Further spectroscopic follow-up was pursued only 
for sources that looked particularly promising in relation with 
the main science objectives of the PTF survey. 

The main challenge of our study was to identify, among the 
list of candidates retrieved using the criteria described here (and 
in the absence of spectral classification for most of them), the 
ones more likely to be of interest for LIGO and Virgo, in the 
sense of having properties consistent with “explosive” events 
such as binary mergers or stellar collapses, that our search was 
targeting. 

4.5.2. Citizen-based Approach 

In addition to the list of candidates described in the previous 
section, we also considered candidates passing selection criteria 
optimized for the identification of young supernovae: 

1. candidate RB parameter value >0.07; 

2. detected at least twice; 

3. flat or rising light curve; 

4. not seen prior to 10 days before the earliest day. 

As part of normal PTF operations during 2010, candidates 
passing the above criteria were further examined by citizen 
scientists through the Galaxy Zoo Supernovae project (Smith 
et al. 2011). The Galaxy Zoo scanners were presented with 
a series of detection “triplets” for each candidate. Each triplet 
contained three images: the current image of the field containing 
the candidate; the historical or reference image of the same field; 
and the image of the difference between the previous two (which 
should contain only the candidate light). Each examiner was 
asked a series of questions to determine if the candidate appeared 
consistent with a supernova, and the answers were converted 
into a score. The arithmetic mean of the scores from many 
scanners was calculated, and candidates with strong (supernova- 
like) scores were counted in our final list of candidates. 

4.5.3. Selection for LIGO /Virgo Event Candidates 

All of the candidates from both the automated approach and 
citizen-based approach were vetted by human scanners to judge 
which candidates deserved to be kept for further investigation as 
“LIGO/Virgo interesting.” To do so, we took advantage of two 
new parameters recently developed by the PTF team, to improve 
confidence in transient identification. The first parameter is the 
so-called “realbogus 2” (RB2; Brink et al. 2013). The RB2 
parameter is similar to the RB parameter, but it was defined 
by using a much larger training sample (78,000 objects). The 
RB2 also utilized some additional features that the original RB 
parameter did not use, including correlations in different PTF 
filters. By using a sample of spectroscopically confirmed sources 
discovered by PTF, it has been found that selecting candidates 
with RB2 >0.3 yields a false positive rate of ~3%, and a missed 
detection rate of »3.2% (Brink et al. 2013). 

The second parameter is known as the Supernova Zoo 
predictor, a ML classifier that was trained using the Supernova 
Zoo mark up of tens of thousands of candidate transients, so 
as to construct a classifier capable of efficiently discovering 
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supernovae. The Supernova Zoo predictor assigns a score 
(hereafter, SN Z00 ) to each of the candidates, which is higher for 
more promising candidates (i.e., the ones that are most likely 
to be real supernovae). By using a sample of spectroscopically 
confirmed supernovae discovered by PTF, it has been found that 
selecting candidates with SN zoo > 0.025 yields a false positive 
rate of «14%, and a missed detection rate of ~ 1 0%. 

For our final selection cuts, we applied the following criteria: 

1 . Was the transient classified spectroscopically as a variable 
star, an AGN, or a SN of type la? If yes, discard. 

2. Was the candidate detected for the first time before the GW 
trigger time? If yes, discard. 

3. Does the transient appear to have subtracted correctly? If 
not, discard after double checking that this is consistent with 
a low value of the RB2 ( RB2 < 0.3) and of the supernova 
zoo predictor parameter (SN ZOO < 0.025). 

4. Is the candidate classified as a STAR in Sloan Digital Sky 
Survey (SDSS), and/or is it spatially coincident with a 
known stellar or AGN source in SIMBAD? If yes, discard. 

5. If the analyzed field is not in the SDSS footprint and 
nothing is found in SIMBAD (see above), can the candidate 
be securely associated with a point-like host in the PTF 
reference image (or in an image taken a year after the 
LIGO/ Virgo trigger in case a previous reference image 
was not available)? If yes, is the Oarical classification (see 
Section 4.5) consistent with a “variable star” and/or is there 
enough photometry to confirm a long-term variable origin 
from the light curve? If yes, discard. 

6. If the analyzed field is not in the SDSS footprint, nothing 
is found in SIMBAD, and a point-like host cannot be 
identified in the reference image (see above), then: Does 
the candidate have both RB2 and SN zoo below threshold? 
Or, is it classified by the Oarical classifier (Section 4.5) as 
variable star or AGN, and is there enough photometry to 
confirm a long-term variable origin from the light curve? If 
yes, discard. 

4.6. Liverpool Telescope Search 

The Liverpool Telescope observed the G23004 trigger using 
both the 4.6 arcmin FOV RATCam instrument and the 1° FOV 
SkyCamZ camera. This produced a total of 22 SDSS r'-band 
RATCam images and 121 “clear” filter SkyCamZ images from 
two nights 29 days apart. In addition, 3 RATCam and 17 
SkyCamZ images were taken in early 2012 to serve as reference 
images for image subtraction. The analysis made use of several 
freely available software packages, and was split into several 
sections written in Python. 

First, we combined the images from 2012 to create our ref- 
erence images. This was done by aligning the images using the 
WCSRemap 177 package and combining them using the SWarp 156 
package. We also combined sets of five SkyCamZ images on 
each night to improve image quality and provide a similar ca- 
dence to the RATCam images. We removed one RATCam image 
and two SkyCamZ images due to quality issues. 

Second, as the SkyCamZ images used a non-standard filter, 1 '’ 7 
they were calibrated using the USNO-B catalog of stars to 
determine the zero point offset required to calculate correct 
magnitudes, in the same way ROTSE and TAROT images were 
calibrated (see Section 4.1). This was done by comparing the 
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USNO-B R-band magnitude of stars in the combined SkyCamZ 
fields with those same stars found using SExtractor. 

The images were then aligned individually to the reference 
images, again using WCSRemap, and the reference image was 
subtracted using the HOTPANTS 178 image subtraction package. 
SExtractor was then used to detect potential candidates in 
each individual field with a minimum of 4 pixels each with a flux 
greater than 4cr above the background noise of the image. This 
reduced the frequency of detecting uninteresting objects, such 
as cosmic rays, extremely faint stars and noise from the image 
subtraction process while allowing us to achieve a sensitivity 
around 20th magnitude in the narrow-field RATCam images. 

Using the output of SExtractor from each of the subtracted 
images, a Python script combined the objects found into a master 
list containing every unique candidate found in those images, 
along with useful parameters from SExtractor. From this data, 
a series of cuts were made to find candidates interesting to 
this analysis. First, candidates found to be near an image edge 
(or a bad pixel strip in the case of RATCam images) were 
rejected. Second, a cut was made to remove artifacts due to 
bad subtraction. This was achieved by examining the region in 
the subtracted image around the candidate and calculating the 
total flux more than 4 a below the median noise of the image. 
Since bad subtractions are usually caused by poor alignment or 
convolution, they typically produce a large amount of “negative” 
flux in the residual image. If the total amount of flux below 
this threshold was the equivalent required for detection of 
candidates (4 pixels above 4er) then the candidate was rejected. 
The next cut removed candidates not seen in at least half of 
the images available on the first night, to ensure candidates 
were visible long enough to be used in our analysis. We also 
rejected candidates that appeared close to known variable stars 
and minor planets. Finally, we required that a candidate must 
decrease in brightness by more than 5er of the median error 
on the magnitude measurements from SExtractor, from the 
first night to the second night 29 days later. Since the pipeline 
is designed to work with images from two telescopes for this 
analysis which may have different magnitude errors for the 
same trigger, we used a threshold based on the noise in the 
image rather than a fixed magnitude variation in the same way 
as ROTSE and TAROT. 

Any objects that remained after these cuts were considered 
likely candidates, and looked at in more detail. This was done 
by plotting the light curves of each object across both nights 
and inspecting images of the candidates in both the original 
and subtracted images. This allowed us to gauge whether any 
transients warranted further investigation. 

5. OPTICAL TRANSIENT SEARCH RESULTS 

In this section we present the details of the associated optical 
images for each GW trigger. The center location of each 
observed field is shown in Table 5. We also present the results 
of the transient analysis for each data set. Data from the two 
periods of our search were handled differently. The winter run 
triggers were not observed with sufficient cadence to reconstruct 
light curves, so only a limited analysis was performed on those 
triggers. Section 5.1 describes the results of the analysis along 
with figures showing the position reconstruction and image 
locations for each winter run GW trigger (Figures 2 and 3). 


155 http://www.astro. Washington. edu/users/becker/v2.0/c_software.html 

1 56 http ://w w w.as tromatic .net/ software/s warp 

157 http://telescope.livjm.ac.uk/Info/TelInst/Inst/SkyCam/ 
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Figure 2. GW skymaps for triggers G3821 (left) and CWB 1 (right). The colored regions show the estimated probability per square degree that each location is the true 
source direction before applying the galaxy weighting. The locations of the observed fields (selected using galaxy weighting) for telescopes that observed the trigger 
are also marked. 


(A color version of this figure is available in the online journal.) 


Table 5 

Center Locations of All Fields Observed 


GW Trigger 

Telescope 

R.A. 

Decl. 

R.A. 

Decl. 

R.A. 

Decl. 


-5.24 


227.61 


-64.26 


G3821 
CWB1 
G4202 
CWB 2 
G19377 

QUEST 
TAROT 
QUEST 
QUEST 
ROTSE-c 
Sky Mapper 


TAROT 

Zadko 

G20190 

ROTSE-abcd 

TAROT 

Zadko 

G21852 

QUEST 

ROTSE-b 

PTF 

G23004 

ROTSE-bcd 
Liverpool 
Pi of the Sky 


104.89 

-27.94 

133.88 

207.21 

-48.80 


89.34 

-0.70 

86.33 

81.00 

-32.49 

75.63 

115.56 

-30.00 


115.43 

-30.03 

120.01 

115.40 

-34.00 

115.39 

110.58 

-33.91 

120.22 

115.40 

-30.00 


110.98 

-27.53 

114.75 

115.80 

-29.98 

115.85 

333.25 

18.03 


333.33 

18.00 


322.49 

12.17 

323.37 

330.17 

17.74 

333.96 

336.29 

8.50 

334.49 

11.04 

41.61 


11.39 

41.62 

55.80 

56.93 

-21.37 

39.42 

55.24 

-16.87 

51.15 

61.97 

-20.91 


61.11 

-2.20 


Various 




-9.78 

89.34 

-5.24 

-50.65 

91.23 

-41.57 

-29.91 

110.78 

-29.92 

-25.99 

1 10.94 

-25.93 

-33.90 



-22.05 

115.25 

-32.07 

-29.22 




-0.82 

19.23 

329.77 

18.18 

10.63 

331.61 

17.57 

-19.12 

52.20 

-19.12 

-7.87 

52.25 

-28.12 

-25.87 

34.38 

-32.62 


Note. All coordinates are in degrees using the J2000 equinox. 


The methods described in Section 4 were applied to the data 
collected in response to each GW trigger in the autumn run. 
To display the sky coverage and depth of each response, two 
panels are presented for each autumn run trigger (Figures 4-7). 
The left panel shows the GW skymap (without the use of 
galaxy weighting) along with the positions and approximate 
field sizes of each observed tile. The right panel shows a 
timeline of the observations by each observatory. The y-axes 
of the timeline plots display the limiting magnitudes of the 
observations. In these plots, for TAROT, POTS, and Zadko, 
each arrow represents one observation. For QUEST, each arrow 
is a median of limiting magnitudes over the CCDs covering the 


on-source galaxy regions observed at a given time. For PTF, the 
arrows give the limiting magnitudes reached in a central CCD 
of the camera, for the field containing the M31 galaxy (which 
was observed with highest cadence). 

The right panel of each figure also shows several models 
for possible EM counterparts. The off-axis long GRB model 
(L-GRB; solid dark green line) is from van Eerten et al. (2010), 
and assumes a total energy in the jets of 2 x 10 51 erg, jet half 
opening angle of 0.2 rad, off-axis observer’s angle of 0.3 rad, 
interstellar medium number density of 1 cm -3 , and distance 
of 30 Mpc. We note that within this model, the associated 
optical transient peaks at ^1 day since trigger. The off-axis 
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Figure 3. GW skymaps for triggers G4202 (left) and CWB2 (right). See Figure 2 caption for explanation. 
(A color version of this figure is available in the online journal.) 
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Figure 4. On the left, the GW skymap for G19377, which was later revealed to be a blind injection. The skymap shows the probability per square degree that each 
location is the true source direction before applying the galaxy weighting. The locations of the observed fields (selected using galaxy weighting) for telescopes that 
observed the trigger are also marked. On the right, a timeline showing when each telescope observed the requested fields, with time zero corresponding to the GW 
trigger time. Model light curves for several sources, scaled to 30 Mpc, are shown for comparison (see Section 5 for details). 

(A color version of this figure is available in the online journal.) 


low-luminosity GRB model (LL-GRB; dash-dot-dot-dotted 
dark green line) is from van Eerten & MacFadyen (201 1), and 
assumes a total energy in the jets of 10 50 erg, jet half opening 
angle of 0.2 rad, off-axis observer’s angle of 0.4 rad, and in- 
terstellar medium density of 1 cm -3 . The off-axis short GRB 
model (S-GRB; dashed dark green line) also refers to a total 
energy in the jets of 10 50 erg (and similar jet and observer’s 
angles), but the interstellar medium density is set to 10“ 3 cm“ 3 . 
The light green line represents the case of a faint short GRB 
observed on-axis (see Table 1 and Kann et al. 2010, 201 1). The 
emission from typical short GRBs and long GRBs observed 
on-axis lies above this line. In particular, on-axis long GRBs at 
30 Mpc would appear as very bright optical transients. 

The kilonova models are courtesy of Barnes & Kasen (dashed 
dark blue), B. Metzger (dark blue), and E. Nakar (light blue). 
Specifically, the light blue line represents one of the kilonova 
bolometric light curves from Piran et al. (2013) (BH-NS merger 
with BH mass of 10 M 0 ). This light curve assumes that all 


of the bolometric luminosity is emitted in the R-band, and it 
represents an upper-limit to the true R-band luminosity of the 
kilonova event. The solid dark blue line is one of the kilonova 
light curves from Metzger et al. (2010), and is calculated for an 
ejecta mass 10“ 2 M 0 assuming a blackbody emission. Finally, 
the dashed dark blue line is one of the kilonova models from 
Barnes & Kasen (2013), for the case of low-velocity (0.1c) 
low-mass (M = 1 0 3 M 0 ) ejecta. Since the kilonova models 
are subject to large uncertainties, we selected these three light 
curves to give an indication of the possible scatter in the model 
predictions. 

Finally, the prototype emission from a GRB-associated SN 
is plotted with a red dotted line: this is a tentative extrapo- 
lation to early times of the R-band light curve observed for 
SN 1998bw (red asterisks; Clocchiatti et al. 2011), associated 
with GRB 980425 (Galama et al. 1998). The light curve as- 
sumes that SN 1998bw exploded at the same time at which 
GRB 980425 was triggered. 
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Figure 5. GW skymap and observations of trigger G20190. See Figure 4 caption for explanation. 
(A color version of this figure is available in the online journal.) 
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Figure 6. GW skymap and observations of trigger G21852. See Figure 4 caption for explanation. 
(A color version of this figure is available in the online journal.) 
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Figure 7. GW skymap and observations of trigger G23004. See Figure 4 caption for explanation. The shown Pi of the Sky (POTS) fields are a subset of the 10 
overlapping pointing positions used to observe the GW uncertainty region. 

(A color version of this figure is available in the online journal.) 
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5.1. Winter Run Triggers 

For each winter run trigger, images were collected only 
during one night. The absence of a second night’s observations 
prevented the construction of variability measures and limited 
the analyses to only identify “unknown objects,” i.e., those not 
listed in the USNO catalog or with a magnitude significantly 
different from USNO, but visible in all the collected images. 
For both the TAROT and QUEST image analysis procedure, at 
least one observation on another night would have been required 
to identify a unique electromagnetic counterpart. 

In the winter run, TAROT responded to one trigger, CWB 1 , 
and collected six images starting the single night observation 
at T+3dllh. The QUEST camera responded to three triggers, 
G3821, G4202, CWB2, starting the observations at T+9h46m, 
T+24m, and T+16hl2m, respectively. For each trigger it col- 
lected images corresponding to three fields. Each field was ob- 
served twice within 20 minutes during the same night. 

The TAROT observation associated with CWB1 reached a 
sensitivity of 15.8 mag. Fifteen galaxies with a distance smaller 
than 50 Mpc were in the FOV. The analysis found 9 unknown 
objects in the on-source region and 46 in the entire FOV up 
to the limiting magnitude. No unknown objects were found 
with magnitude brighter than 11.8 in the on-source region and 
brighter than 10.7 mag in the entire FOV. 

The three QUEST fields associated with G3821 included a 
total of 34 galaxies with a distance smaller than 50 Mpc. Only 
14 of the galaxies were analyzed due to the exclusion of galaxies 
observed only one time or lying in CCDs that did not work or 
had calibration problems. The average limiting magnitude was 
about 18.6 mag. 

For trigger G4202 the three fields included a total of 17 
galaxies with a distance smaller than 50 Mpc. Ten galaxies 
were removed from the analysis because they were observed 
only one time or associated with poor image quality (impacted 
by bad lines and pixels or by background subtraction artifacts) 
or calibration problems of the CCDs (astrometric calibration or 
flat-field problems). An average limiting magnitude of 19.2 mag 
was reached during the observations. 

For trigger CWB2 the three fields included a total of 12 
galaxies with a distance smaller than 50 Mpc. Two of the 
galaxies were not analyzed due to poor image quality or 
CCD calibration problems. An average limiting magnitude of 
19.8 mag was reached during the observations. 

The QUEST analysis found 9, 1 and 1 unknown objects in the 
on-source region with a magnitude brighter than 14 mag for the 
triggers G3821, G4202 and CWB2, respectively. The number 
of unknown objects increased to 140, 35, 6 for magnitudes 
brighter than 18 mag. The number of unknown objects showed 
a stronger dependence on the density of image artifacts and stars 
in the FOV than on the on-source area. No “unknown objects” 
were found for magnitude brighter than 9, 12, 7 mag for G3821, 
G4202 and CWB2, respectively. 

5.2. G19377 

Event G 19377 was a simulated signal added to the GW 
detector data in order to test our data analysis pipelines. The 
ROTSE-IIIc telescope responded at T+~ 1 2 hr when 30, 20 s 
exposure images were taken within ~ 15 minutes. On subsequent 
follow-up nights (6-29) both ROTSE-IIIa and c telescopes 
gathered 80, 20 s exposure images. The images from the two 
scopes varied vastly in terms of image quality, which posed 
difficulties for injection studies. We discarded the lower quality 
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images from the 3c telescope, leaving just the 3a images, with an 
average limiting magnitude of 15.1. Two galaxies at ~24 Mpc 
(PGC 078144 and PGC 078133) were visible within the FOV. 
The ROTSE image processing pipeline revealed 68 unique 
objects, one of which passed the candidate validation. Further 
tests found this candidate was consistent with background, with 
a FAP of 7%. This left no significant candidates. At the location 
of this background transient there is a known star (red magnitude 
of 13.1 from the USNO catalog), which shows no significant 
magnitude variation in the TAROT images associated with the 
same GW trigger. This location was not covered by the Swift 
observations taken for G19377. We also tried analyzing images 
from both the 3a and 3c telescopes together, and found no 
additional candidates. 

SkyMapper observed an eight tile mosaic, 7 days after 
the initial alert. An analysis was performed, but no plausible 
transients were discovered. 

TAROT took images starting at T+43m and repeated the 
observations at T+2d, T+3d and T+4d. Observations from the 
four nights displayed an average limiting magnitude of 15.1. 
The on-source analysis was performed on the two same galaxies 
observed by ROTSE and identified no transient counterpart. The 
whole-field analysis was performed with an initial magnitude 
threshold of 14 mag, and identified one transient candidate with 
a slope index of 0.6. A deeper analysis showed that this candidate 
resulted from an artifact of the de-blending in crowded images. 

The Zadko telescope observed the regions around the five 
galaxies evaluated to be the most likely hosts of the G 19377 
trigger: NGC 2380, ESO 560-004, ESO 429-012, PGC 078133, 
and PGC 078144; the last two being in common with ROTSE 
and TAROT. The observations started at T+ldl2.6h and were 
repeated 5 months later for reference. The average limiting 
magnitude for both the early and reference images was 16.5 mag. 
No electromagnetic counterparts were identified by either the 
on-source on whole-field analysis. 

5.3. G20190 

All four ROTSE-III telescopes responded to this GW trig- 
ger, taking images spanning T+34h38m to T+29d, centered on 
the region around the galaxy UGC 1 1944. However, all images 
taken with the ROTSE-IIIa, b and d telescopes were discarded 
because of defocusing factors in addition to weather conditions 
at those sites being less than optimal. This resulted in 56 images 
being used for the analysis, with an average limiting magnitude 
of 15.5. The ROTSE image subtraction pipeline found 77 poten- 
tial candidates, none of which passed the candidate validation 
procedure. 

The TAROT telescope collected three images in association 
with G20190. Due to the full moon only an average limiting 
magnitude of 14.6 mag was reached. Nine months later 18 
images were taken by TAROT in the same region of the sky as 
reference. A mean limiting magnitude of 17 mag was reached 
during this second observation. No counterpart with a FAP less 
than 10% was identified by the on-source analysis. The whole- 
field analysis was performed with a threshold of 10 mag on 
the initial magnitude and the required presence in the first three 
images and absence in the reference images. It resulted in four 
identified candidates. The candidates were seen to be image 
artifacts linked to the spikes of saturated stars. 

The Zadko telescope was pointed toward two Galactic globu- 
lar clusters: NGC 7078 and NGC 7089, and three galaxies UGC 
11868, NGC 7177, and NGC 7241, evaluated to be the most 
likely hosts of the GW source. Observations of galaxies UGC 
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11868 and NGC 7241 were taken about 50 minutes after the 
GW trigger. All five fields were observed subsequently during 
at least two nights between T+ld and T+4d. The observations 
were repeated 1 1 months later for reference. The average limit- 
ing magnitudes were 16.4 mag and 17.3 mag for the very first 
and reference observations, respectively. The on-source analysis 
identified three transient candidates associated with NGC 7078 
and 15 associated with the center of NGC 7089. The candidates 
were found to be due to problematic de-blending in the cen- 
tral region of globular clusters. No transient was identified by 
the on-source analysis associated with the three galaxies. The 
whole-field analysis required a magnitude brighter than 10 and 
the presence during the first nights and absence in the reference 
images. This resulted in no detected transient. 

The QUEST observations started at T+12h3m. Each field was 
observed twice within 15 minutes as pairs of images dithered 
to fill the gaps between rows of CCDs. The entire observation 
sequence was repeated at T+1.5d. A total of 10 galaxies with a 
distance smaller than 30 Mpc were identified in the three fields. 
Three of the galaxies were not analyzed due to poor image 
quality CCDs or calibration problems. The observation was 
taken during a full moon night that allowed an average limiting 
magnitude of 17.6 mag. The on-source analysis 159 identified 
one possible transient with a FAP less than 10% (see Section 6) 
associated with the galaxy UGC 11916. A deeper analysis of 
the candidate showed this to be artificial. The analysis pipeline 
identified the possible GW host galaxy itself as a transient due to 
variations in the estimate of its surface photometry over the two 
nights. An estimate using fixed photometry apertures indicated 
magnitudes in agreement within the errors with no flux decrease. 

5.4. G21852 

ROTSE-IIIb took images spanning T+llh53m to T+29d 
centered on a region containing both M31 and M110. One 
follow-up night had to be ignored due to defocusing issues. 
The average limiting magnitude of the images was 16.6, with 
81% of them having an exposure times of 60s. The subtraction 
pipeline found 187 objects, which resulted in four candidates 
after candidate validation. All four candidates overlapped with 
one of the galaxies mentioned, however all were consistent with 
background. The highest ranked candidate had a FAP of 9%. 
Consequently, we found no significant candidates. Within the 
2 arcsec positional accuracy of PTF, the ROTSE background 
events are all coincident with known stars, and according to the 
PTF analysis criteria applied, these sources are not considered 
candidates. 

PTF observed nine different fields on five nights, beginning 
at T+6h37m. The median limiting magnitude reached in the 
observed fields over the observation time (and over the 11 
CCDs that make the core of the PTF imager) was in the range 
R ~ 20.2-19.2. The images collected by PTF were analyzed 
using two different procedures for transient identification, one 
entirely based on automated selection criteria for fast transients, 
and the other largely based on a citizen project targeting su- 
pernovae (see Section 4.5 for more details). These procedures 
for transient identification were routinely used by the PTF sur- 
vey (P. E. Nugent et al. 2014, in preparation). By applying the 
selection criteria for fast transients (automated approach; see 
Section 4.5.1) on the images that were taken for follow-up of 
trigger G21852, we obtained a list of 172 candidates, none of 


159 The 7% of the total on-source area within the gaps between the CCDs 
does not have data and was not analyzed. 
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which passed the vetting for “LIGO/Virgo interesting” tran- 
sients performed according to the criteria described in Sec- 
tion 4.5.3. We also applied these last criteria to the candidates 
obtained via the citizen-based approach (optimized for super- 
nova searches — see Section 4.5.2). Of the 218 candidates se- 
lected according to criteria ( 1)— (4) in Section 4.5.2 and sent out 
to the citizens for scanning, 28 were saved by the citizens and 
assigned an official PTF name. However, none of these 28 can- 
didates passed the additional vetting described in Section 4.5.3. 
We also took a closer look at 55 other candidates that were 
not saved by the citizens, but that had a SN zoo predictor score 
>0.025 or a RB2 > 0.3 (see Section 4.5.3). We vetted these 
candidates according to criteria (l)-(5) in Section 4.5.3, and 
none of them passed our screening. 

5.5. G23004 

The ROTSE-IIIb, c and d telescopes responded to G23004 
at T+6h25m and collected data up to T+29d. These images 
contained one galaxy (NGC 1518) at 1 1.5 Mpc within the FOV. 
Around 75% of the data was of poor quality; many of the images 
were out of focus and cloud cover was also a factor. This resulted 
in the analysis of 30 images with an average limiting magnitude 
of 16.7. The ROTSE subtraction pipeline found 124 potential 
candidates of which none survived the candidate validation tests. 

The Liverpool Telescope observed a single field centered 
on the location of the galaxy NGC 1507, with one hour of 
observations taken at T+9h and a further one hour at T+30d. 
The limiting magnitude of the RATCam images was r' ~ 20.5, 
averaged over all images, with the calibrated limiting magnitude 
of the SkyCamZ images averaging R ~ 17.5. We found 406 
unique objects in the RATCam images and 163 unique objects in 
the SkyCamZ images. After applying cuts described in Section 4 
we found no candidates in either the RATCam or SkyCamZ 
images that met our criteria. 

The POTS telescope responded at T+6h56m after the alert. On 
the first night the telescope used 10 different pointing locations 
to cover an area containing 40% of the G23004 probability map. 
Each location was imaged twice. The limiting magnitudes for 
the first night’s observations spanned 10.5-11.0 mag. On the 
first night there were over 700 cases that were recognized by 
the pipeline as possible optical transients, but all of them were 
either already included in the database of weak stars or were 
noise due to ice crystals on the camera. There were no real 
optical transients found. The same fields were followed up on 
the nights of October 5, 6, 7, 11, and 30. Each follow-up night’s 
observed area was covered by nine pointing locations, with each 
location imaged at least three times. Images from the first four 
nights were searched by the pipeline for optical transients, and 
40 objects were identified as existing in images over multiple 
nights and have been present on all frames that were taken of 
that field. Each of these was manually investigated, and none 
were found to be linked to the GW trigger. Most of the 40 objects 
were traced to variable stars or were caused by ice crystals on 
the camera. 

The QUEST follow-up for this GW trigger consisted of three 
nights of observations over three different fields. The first obser- 
vation began at T+l lh32m and then observations were repeated 
at T+2.4d and T+32.4d. Each night’s observations included two 
visits to each of two dithered positions for each of the three 
field locations. A total of 32 galaxies with a distance smaller 
than 50 Mpc were identified in the three fields. Due to inop- 
erative CCDs or CCD calibration problems the regions occu- 
pied by four galaxies were not analyzed. The average limiting 
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Figure 8. Efficiency in recovering simulated optical transients in the TAROT data (left) and Zadko data (right). The figure reflects the success rate in recovering 
transients added to the observed fields, and does not include efficiency lost due to observing only a fraction of the possible source locations. The signals have been 
simulated based on the models shown in Table 1 , with the power law flux of each GRB randomly scaled within the shown range of normalization conditions. 

(A color version of this figure is available in the online journal.) 


magnitude for the three night observations was 19.7 mag. The 
on-source 160 analysis identified one possible transient with an 
“on source” FAP of less than 10% (see Section 6). The candidate 
transient overlaid the extended emission of the galaxy IC0402. 
A deeper analysis indicated no flux change for the object: the 
point source immersed in the fainter galaxy edge emission has 
a similar neighboring object that biased its photometry. Using a 
suitable fixed photometry aperture the magnitudes of the object 
agree within the errors in all the images. The object could be a 
foreground star not listed in the USNO catalog or a bright knot 
of one of the galaxy’s arms. 

6. EFFICIENCIES FOR RECOVERING SIMULATED 
OPTICAL TRANSIENTS 

Simulated transients were added to each set of images 
to measure the efficiency in recovering optical counterparts 
located at different distances from earth. The different telescope 
pipelines were run over the simulated data with the same analysis 
tuning used in the real data. For TAROT, Zadko, QUEST, 
ROTSE, and the Liverpool Telescope the simulated transients 
reproduced the observed light curves (see, e.g.. Figures 5 and 
4 of Kann et al. 2011, 2010) of on-axis GRB afterglows and a 
modeled light curve for the kilonovae (Metzger et al. 2010; Piran 
et al. 2013). Table 1 summarizes the features of injected models. 
These models were scaled on the basis of the observation time 
from the GW trigger and the source distance. We emphasize 
here that while the simulated GRB afterglows cover the range 
of observed luminosities, kilonovae have not been observed 
yet and so our efficiency results are dependent on the assumed 
model. 

6.1. TAROT and Zadko Telescope 

For each set of images collected by TAROT and the Zadko 
telescopes, 100 simulated transients were added to the data for 
each counterpart model and distance. To model PSF variations 
in the wide-field images, reference model stars were identified 
in each image, and the PSF of the reference star closest to the 


160 The 10% of the total on-source area within the gaps between the CCDs 
does not have data and was not analyzed. 


injection position was used for each simulated object. For the 
GRB afterglows, we used a range of magnitudes uniformly dis- 
tributed between the brightest and faintest GRBs (see normal- 
ization in Table 1). The results are presented in Figure 8. Long 
GRB afterglows/short GRB afterglows/kilonovae were recov- 
ered with 50% efficiency in TAROT observations to distances 
of 400 Mpc/18 Mpc/6.5 Mpc respectively for trigger G19377 
and 355 Mpc/16 Mpc/13 Mpc for trigger G20190. For Zadko 
Telescope observations, we obtained 195 Mpc/8 Mpc/4 Mpc 
for G 19377, and 505 Mpc/ 25 Mpc 13 Mpc for G20190. As 
expected, the results showed some dependence on the depth of 
the observations, the observation time after the GW trigger, and 
the density of stars in the field. 

6.2. QUEST 

The QUEST pipeline’s recovery efficiency was evaluated sep- 
arately for each on-source galaxy region. As for TAROT and 
Zadko, 100 simulated transients were added to the images for 
each model (kilonova, short and long GRBs) and distance. Ran- 
domly distributed magnitudes between the brightest and faintest 
GRBs (see normalization in Table 1) were used. Figures 9-11 
show some representative examples of the achieved recovery 
efficiencies. The wide range in the recovery efficiencies reflects 
variations in CCD sensitivity and rates of contaminating arti- 
facts. In addition, bright galaxy extended emission prevented 
the recovery of some injections, even at close distances. A sim- 
ilar efficiency loss was found when a large part of the on-source 
region was occupied by foreground stars or image problems 
like bad pixels and bad lines. The results for the QUEST ob- 
servations can be characterized by the mean and the standard 
deviation of the distances corresponding to 50% efficiency to re- 
cover injections. For trigger G20190, we found mean distances 
of 33 Mpc (<t — 1 Mpc) for kilonovae, 30 Mpc (er = 6 Mpc) for 
short GRBs, and 820 Mpc (er ~ 180 Mpc) for long GRBs. For 
G23004, a mean distance of 64 Mpc (er = 25 Mpc) for kilono- 
vae, 63 Mpc (er = 30 Mpc) for short GRBs, and 1530 Mpc (er ~ 
700 Mpc) for long GRBs were found. 161 The larger spreads for 


161 Taking into account the galaxy regions lying in the CCD gaps, the 50% 
efficiency distances for G20190 (G23004) reduce to 32 (61) Mpc for 
kilonovae, 26 (53) Mpc for short GRBs, and 700 (1260) Mpc for long GRBs. 
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Figure 9. Some representative success rates in recovering simulated kilonovae lightcurves with the QUEST data for triggers G20190 (left) and G23004 (right). Each 
curve represents the efficiency from individual on-source galaxy regions, and so does not include efficiency lost due to observing only a fraction of the possible source 
locations. 

(A color version of this figure is available in the online journal.) 
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Figure 10. Some representative success rates in recovering simulated short GRB afterglow light curves with the QUEST data for triggers G20190 (left) and G23004 
(right). Each curve represents the results from individual on-source galaxy regions, and so does not include efficiency lost due to observing only a fraction of the 
possible source locations. Each simulated afterglow lightcurve was randomly scaled within the range of normalization conditions showed in Table 1. 

(A color version of this figure is available in the online journal.) 



QUEST reflect CCD-to-CCD variations. For both GW triggers, 
the 50% efficiency distances for long GRB afterglows were 
well beyond the maximum distance that the L1GO and Virgo 
detectors could have detected signals coming from NS binary 
coalescences, while the kilonova and short GRB distances were 
comparable. However, the result obtained for the kilonova tran- 
sients is dependent on the adopted model and relies on the fact 
that the QUEST observations were made around the peak time 
of the light curve model used for this study. 

6.3. ROTSE 

For each set of images collected by ROTSE, 140 simulated 
transients were added to the data for each counterpart model 
for 10 different distances. The PSFs for the injected transients 
were modeled on “good” objects PSFs within each image, as 


described in White et al. (2012). The GRB models used the 
brightest normalizations shown in Table 1; i.e., assuming mag- 
nitude 16 (23) at 1 day from z = 1 for LGRB (SGRB) after- 
glows. The results are presented in Figure 13. For each GW 
trigger, the efficiencies for the different counterpart models are 
very similar as functions of the injection magnitude. The effi- 
ciencies peak at ~70%-80% for triggers G 19377 and G20190, 
and at ~55% for G21852. Trigger G23004 (not shown) con- 
tained images of very poor quality and the injection efficiency 
only reached a maximum of ~20%. Long GRB afterglows/ 
short GRB afterglows/kilonovae were recovered with 50% de- 
tection efficiency to distances of 400 Mpc/ 16 Mpc/2 Mpc for 
trigger G 19377, 1000 Mpc/40 Mpc/5 Mpc for trigger G20190, 
and 1000 Mpc/90 Mpc/5 Mpc for trigger G21852. The maxi- 
mum sensitive distances correspond to transient magnitudes of 
approximately 15 on the second night. This was typical of the 
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Figure 11. Some representative success rates recovering simulated long GRB light curves with the QUEST data for triggers G20190 (left) and G23004 (right). Each 
curve shows the results from individual on-source galaxy regions, and so does not include efficiency lost due to observing only a fraction of the possible source 
locations. Each simulated afterglow lightcurve was randomly scaled within the range of normalization conditions showed in Table 1. 

(A color version of this figure is available in the online journal.) 



Figure 12. Distribution of ROTSE background (time-shifted) triggers and recovered injections for event G 19377. This plot shows the distribution ranking statistic for 
kilonova injections simulated from 1 Mpc, short GRBs from 7.9 Mpc, and long GRBs from 200 Mpc. The GRB models correspond to the brightest observed GRB 
afterglows. 

(A color version of this figure is available in the online journal.) 


average limiting magnitude of ROTSE over the FOV. Since the 
pipeline required transients to be seen on at least two nights, 
the magnitude on the second night was the primary factor deter- 
mining the sensitivity to each model. Transients at much smaller 
distances tended to suffer from saturation and were discarded in 
the image subtraction. The maximum detection efficiency was 
less than 100% because the pipeline was not always able to 
produce the background- subtracted lightcurve for a transient; 
this depended on the position in the image and on the image 
quality, as 16 reference stars were needed in the region around 
the transient for accurate image subtraction. Variations in effi- 
ciency between triggers were due mainly to differences in image 
quality and also differences in CCD performance between the 
different telescopes in the ROTSE network. 

An example of the distribution of injections against the 
background can be seen in Figure 12. This figure shows that of all 
the injections that produced a nonzero ranking statistic with the 
specific distance scales shown, more than 60% of the injections 
were recovered with a rank comparable to the most highly 
ranked background event. However, none of the injections were 


found with a ranking statistic higher than loudest background 
event. As the injection distances increased, the injections fell 
more and more within the background. 


6.4. Liverpool Telescope 

The efficiency of the Liverpool Telescope pipeline was 
measured with the same methods used for ROTSE. A Python 
script was written to inject 100 transient objects per 10 Mpc 
bin per model, with light curves following the three models 
described in Table 1 , assuming the brightest normalization for 
the GRB models. These images were then analyzed using the 
pipeline, and a script used to find and flag injections found in the 
pipeline output. Figure 14 shows that we obtained efficiencies 
around 90% for injections brighter than the limiting magnitude, 
including saturated objects normally discarded in other image 
subtraction methods. For RATCam, any of the tested models 
would have been observable out to 100 Mpc or more — well 
beyond the initial LIGO/Virgo horizon distance forNS mergers. 
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Figure 13. Efficiency of the ROTSE pipeline in recovering simulated kilonovae 
transients (left, solid), short GRBs (middle, dash-dotted), and long GRBs (right, 
dashed). The figure reflects the success rate in recovering transients added 
to the observed fields, and does not include efficiency lost due to observing 
only a fraction of the possible source locations. The efficiencies shown for the 
GRB afterglow models are based on the brightest models shown in Table 1 . At 
very close distances, the simulated objects became so bright that they caused 
saturations in the data, and were missed by the pipeline. The images associated 
with trigger G23004 were of poor quality, so the efficiencies with this data are 
not shown. 

(A color version of this figure is available in the online journal.) 

For SkyCamZ, we found similar efficiencies, over smaller 
distance ranges. 

6.5. Pi of the Sky 

The efficiency of the POTS transient search was investigated 
by adding simulated stars to existing images and reprocessing 
them. The objects that were injected had different magnitudes 
and were chosen from real observed stars during the autumn 
science run. Unlike the other simulations described in this paper, 
objects added to POTS data did not follow model light curves, 
but instead measured the ability of the pipeline to recover a 
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transient of a given magnitude using data from a single night. 
Stars injected in one image were also injected in subsequent 
images of that field taken during the same night. Only injections 
that were made to the inner part of the CCD chip, at least 150 
pixels from CCD borders, were considered to estimate transient 
detection efficiency. The border part of the CCD was rejected 
by the off-line optical transient recognition algorithm due to 
the possibility of CCD anomalies that might be mistaken as 
short optical transients. Also, only injections starting on a good 
quality image were considered in efficiency estimation. This 
means that the effective FOV for optical transient recognition 
corresponds to 15° x 15°. At each stage of the processing it was 
determined how many of the injected objects were detected. 

Figure 15 shows two curves demonstrating the efficiency of 
the POTS pipeline. The first one describes how many of the 
injected objects were detected in at least one image and the 
second curve shows how many of the injections were detected 
in five or more images. The first case corresponds to the minimal 
criterion that was required for the candidate to be classified as 
an optical transient and be inspected by a human. The second 
case reflects the criteria used for an optical transient to have 
been automatically classified as a nearly certain real event. 
On both curves we see that the maximal efficiency did not 
reach near 100%, even for very bright sources. This can be 
attributed to several causes. An important loss of efficiency 
came from areas excluded from the search due to the presence 
of previously discovered stars. Objects injected within a radius 
of 150" of stars listed in the POTS star catalogue were not 
recognized as optical transients and discarded by the pipeline, 
resulting in a 12%-15% impact to the injection recovery rate. 
Additional sources were lost to structure in the CCD: 10%-15% 
of the CCD area consisted of wire guiding electric charge. A 
significant part of the losses also came from quality checks in 
the algorithm preprocessing. At this stage transients that were 
fainter than 1 1th magnitude, or observed on multiple low quality 
frames, were discarded. This impacted the efficiency by 10% 
for bright transients, and up to 30% for faint transients injected 
with brightness around magnitude 11. Other cuts in the data 
processing pipeline resulted in an additional 3%— 10% loss of 
efficiency. 



Figure 14. Success rates recovering simulated short GRB afterglows, long GRB afterglows and kilonova light curves for the Liverpool Telescope, using the RATCam 
(left) and SkyCamZ (right). The figure reflects the success rate in recovering transients added to the observed fields, and does not include efficiency lost due to 
observing only a fraction of the possible source locations. The shown results for GRB afterglows are based on the brightest models that we considered. 

(A color version of this figure is available in the online journal.) 
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Figure 15. Success rate of the Pi of the Sky pipeline in recovering simulated 
transients of various magnitudes. The figure reflects the success rate in 
recovering transients added to the observed fields, and does not include 
efficiency lost due to observing only a fraction of the possible source locations. 
(A color version of this figure is available in the online journal.) 


7. DISCUSSION AND SUMMARY 

This paper describes the first end-to-end searches for optical 
transients associated with GW candidate events. Unfortunately, 
no convincing transient counterpart was found. This effort in- 
cluded a range of different types of telescopes, as well as a 
range of different analysis strategies. While the variety of anal- 
ysis strategies employed presents a challenge for interpretating 
the results, we believe that this approach is forward-looking. 
The LIGO and Virgo collaborations have recently made an open 
call for partners to search for EM counterparts to GW events 
discovered with the next generation of GW detectors. 162 It is 
likely that partners will use a variety of facilities and instru- 
ments, and each apply their own data analysis techniques. Both 
the successes and lessons of this work should serve as useful 
guideposts to investigators pursuing similar searches with the 
up-coming “advanced” generation of GW detectors. Strategies 
are also being discussed in the literature (Metzger & Berger 
2012; Nissanke et al. 2013; Singer et al. 2013). 

Rapidly down-selecting candidates for follow-up observa- 
tions, integrating results for astrophysical interpretation, and 
communicating findings will require a common framework to 
describe transients discovered with disparate techniques. In this 
work, we presented two complementary statistics for character- 
izing the results of a transient classification pipeline, the FAP 
and the detection efficiency. These statistics were calculated 
for several different analyses, so that objects discovered in the 
searches could be quantitatively evaluated and compared. This 
paradigm, where results from transient searches with different 
selection criteria must be discussed in a common language, is 
likely to be a theme that becomes more common as survey 
instruments evolve. 

Because GW event candidates are poorly localized, searches 
for counterparts need to consider the large population of 
optically variable sources that could produce false positive 
coincidences (Kulkarni & Kasliwal 2009). Classification based 
on light curves, spectroscopy and other properties can help, but 
these strategies are complicated by the fact that the light curves 


162 http://www.ligo.org/science/GWEMalerts.php 


associated with compact object mergers and other potential 
GW sources are largely uncertain. However, we were able to 
demonstrate several automated strategies that reduced false 
positives, while selecting for a wide range of models. These 
techniques included demands on the rate of dimming in objects, 
spatial coincidence with galaxies within the GW observable 
distance, anti-coincidence with cataloged stars and asteroids, 
and shapes consistent with point objects. For a variety of data 
sets over wide areas, we showed how these cuts could be 
applied to reduce the rate of false positives to less than 10%, 
meaning that a single telescope taking a series of images in 
response to a GW trigger would have less than a 10% chance 
of reporting a false positive. Monte Carlo simulations of model 
light curves were used to show that this false-positive rejection 
was possible while still maintaining sensitivity to models of 
both GRB afterglow light curves and kilonova light curves. 

Follow-up observations of the type presented in this paper 
will probably be just the first stage in efforts to find Advanced 
LIGO/ Virgo counterparts. While essential to identify candidate 
counterparts, wide field imaging is unlikely to be sufficient 
to make definitive associations with a GW trigger. Further 
observations, including sensitive photometry and spectroscopy, 
will be needed to confirm possible associations and characterize 
the source. The level of false-positive rejection achieved by 
software in this work, if promptly applied to collected optical 
image data, would reduce candidate objects associated with a 
LIGO/ Virgo trigger to a manageable level, such that they could 
be pursued with further follow-up observations. The challenge 
presented by false positives is likely to increase with the advent 
of Advanced LIGO/Virgo, when a larger horizon distance will 
require imaging to fainter magnitudes, and so increase the 
number of potential contaminants. 

The Monte Carlo studies we performed demonstrated that we 
typically recovered a range of light curve models to a depth 
consistent with the limiting flux of the observations, proving the 
validity of our selection criteria. During the observing periods, 
typical position averaged sensitive ranges for NS-NS mergers 
was 18 Mpc, or 35 Mpc for NS-BH mergers (Abadie et al. 
2012e). The efficiency curves shown in Figures 9 through 
15 show that the data sets with better limiting magnitudes 
(QUEST, Liverpool Telescope) were successful in recovering 
all the considered models at these distance scales. The less 
sensitive data sets (ROTSE, TAROT, Zadko) would have missed 
a kilonova at these distances, but were potentially sensitive to 
GRB afterglows. Looking toward the future, the simulation 
results show that short exposures (~1 minute) with small 
aperture telescopes, with observations to depths of less than 
18th magnitude, failed to recover short GRB or kilonova light 
curves at distances comparable to the expected 200 Mpc range of 
advanced GW detectors to NS-NS mergers (Abadie et al. 2010b; 
Aasi et al. 2013). This means that, while smaller telescopes may 
be valuable in searching for counterparts to galactic GW sources, 
they may require long total exposures, and/or a hierarchical 
observing strategy with larger telescopes, to be able to detect 
the expected optical signature of distant compact object mergers. 
Another factor that is likely to impact transient recovery in 
the advanced detector era is the incompleteness of available 
galaxy catalogs (Nissanke et al. 2013). Currently, catalogs 
are missing a significant fraction of the extragalactic starlight 
within 200 Mpc, however, planned surveys can help address this 
problem (Nissanke et al. 2013). 

This study has been a valuable exercise that will help the 
preparation of the data analysis and observing strategies for 
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the up-coming second generation GW detectors, which are 
anticipated to begin operating in 2015 and to improve in 
sensitivity over the following few years (Aasi et al. 2013). 
Searches for optical and other transient counterparts will become 
even more compelling as the range of the detectors increases. 
Moreover, the rapid growth of large area survey instruments, 
including plans for the Large Synoptic Survey Telescope (Ivezic 
et al. 2011), means that the problem of choosing among rapidly 
fading candidates selected with different criteria is likely to 
become a theme that extends beyond GW related searches. The 
LIGO and Virgo collaborations are committed to providing 
prompt triggers for astronomers to follow up, with a more 
open model to allow broader participation (LIGO Scientific 
Collaboration & Virgo Collaboration 2012). We can therefore 
hope that future searches will yield multi-messenger transient 
events that reveal the astrophysical sources and processes that 
produce them. 
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our Figures 4-7. 
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